Dataset statistics
| Number of variables | 8 |
|---|---|
| Number of observations | 1574274 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 96.1 MiB |
| Average record size in memory | 64.0 B |
Variable types
| Numeric | 8 |
|---|
Dataset
| Description | This profiling report was generated for Python Assignment 4 |
|---|---|
| URL | https://www.assignment4.com/bitcoin/ |
| Copyright | (c) Shubham Mishra 2021 |
Timestamp is highly correlated with Open and 5 other fields | High correlation |
Open is highly correlated with Timestamp and 5 other fields | High correlation |
High is highly correlated with Timestamp and 5 other fields | High correlation |
Low is highly correlated with Timestamp and 5 other fields | High correlation |
Close is highly correlated with Timestamp and 5 other fields | High correlation |
Volume_(BTC) is highly correlated with Volume_(Currency) | High correlation |
Volume_(Currency) is highly correlated with Timestamp and 6 other fields | High correlation |
Weighted_Price is highly correlated with Timestamp and 5 other fields | High correlation |
Timestamp is highly correlated with Open and 4 other fields | High correlation |
Open is highly correlated with Timestamp and 4 other fields | High correlation |
High is highly correlated with Timestamp and 4 other fields | High correlation |
Low is highly correlated with Timestamp and 4 other fields | High correlation |
Close is highly correlated with Timestamp and 4 other fields | High correlation |
Volume_(BTC) is highly correlated with Volume_(Currency) | High correlation |
Volume_(Currency) is highly correlated with Volume_(BTC) | High correlation |
Weighted_Price is highly correlated with Timestamp and 4 other fields | High correlation |
Timestamp is highly correlated with Open and 4 other fields | High correlation |
Open is highly correlated with Timestamp and 4 other fields | High correlation |
High is highly correlated with Timestamp and 4 other fields | High correlation |
Low is highly correlated with Timestamp and 4 other fields | High correlation |
Close is highly correlated with Timestamp and 4 other fields | High correlation |
Volume_(BTC) is highly correlated with Volume_(Currency) | High correlation |
Volume_(Currency) is highly correlated with Volume_(BTC) | High correlation |
Weighted_Price is highly correlated with Timestamp and 4 other fields | High correlation |
Timestamp is highly correlated with Open and 4 other fields | High correlation |
Open is highly correlated with Timestamp and 4 other fields | High correlation |
High is highly correlated with Timestamp and 4 other fields | High correlation |
Low is highly correlated with Timestamp and 4 other fields | High correlation |
Close is highly correlated with Timestamp and 4 other fields | High correlation |
Volume_(BTC) is highly correlated with Volume_(Currency) | High correlation |
Volume_(Currency) is highly correlated with Volume_(BTC) | High correlation |
Weighted_Price is highly correlated with Timestamp and 4 other fields | High correlation |
Volume_(Currency) is highly skewed (γ1 = 23.51722215) | Skewed |
Timestamp has unique values | Unique |
Reproduction
| Analysis started | 2021-11-15 15:01:13.212300 |
|---|---|
| Analysis finished | 2021-11-15 15:02:30.057784 |
| Duration | 1 minute and 16.85 seconds |
| Software version | pandas-profiling v3.1.0 |
| Download configuration | config.json |
Timestamp
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONUNIQUE| Distinct | 1574274 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1468131457 |
| Minimum | 1417411980 |
|---|---|
| Maximum | 1515369600 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 12.0 MiB |
Quantile statistics
| Minimum | 1417411980 |
|---|---|
| 5-th percentile | 1425636039 |
| Q1 | 1444527315 |
| median | 1468141410 |
| Q3 | 1491755505 |
| 95-th percentile | 1510646781 |
| Maximum | 1515369600 |
| Range | 97957620 |
| Interquartile range (IQR) | 47228190 |
Descriptive statistics
| Standard deviation | 27285002.82 |
|---|---|
| Coefficient of variation (CV) | 0.01858484994 |
| Kurtosis | -1.196379039 |
| Mean | 1468131457 |
| Median Absolute Deviation (MAD) | 23614110 |
| Skewness | -0.002393726514 |
| Sum | 2.311241181 × 1015 |
| Variance | 7.44471379 × 1014 |
| Monotonicity | Strictly increasing |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 1447034880 | 1 | < 0.1% |
| 1443787620 | 1 | < 0.1% |
| 1508109540 | 1 | < 0.1% |
| 1422963900 | 1 | < 0.1% |
| 1483826760 | 1 | < 0.1% |
| 1466492100 | 1 | < 0.1% |
| 1487394300 | 1 | < 0.1% |
| 1429247160 | 1 | < 0.1% |
| 1504325160 | 1 | < 0.1% |
| 1427174580 | 1 | < 0.1% |
| Other values (1574264) | 1574264 |
| Value | Count | Frequency (%) |
| 1417411980 | 1 | |
| 1417412040 | 1 | |
| 1417412100 | 1 | |
| 1417412160 | 1 | |
| 1417412220 | 1 | |
| 1417412280 | 1 | |
| 1417412340 | 1 | |
| 1417412400 | 1 | |
| 1417412460 | 1 | |
| 1417412520 | 1 |
| Value | Count | Frequency (%) |
| 1515369600 | 1 | |
| 1515369540 | 1 | |
| 1515369480 | 1 | |
| 1515369420 | 1 | |
| 1515369360 | 1 | |
| 1515369300 | 1 | |
| 1515369240 | 1 | |
| 1515369180 | 1 | |
| 1515369120 | 1 | |
| 1515369060 | 1 |
| Distinct | 270983 |
|---|---|
| Distinct (%) | 17.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1705.117812 |
| Minimum | 0.06 |
|---|---|
| Maximum | 19891.99 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 12.0 MiB |
Quantile statistics
| Minimum | 0.06 |
|---|---|
| 5-th percentile | 229.54 |
| Q1 | 290.3 |
| median | 590.05 |
| Q3 | 1224.49 |
| 95-th percentile | 7379.94 |
| Maximum | 19891.99 |
| Range | 19891.93 |
| Interquartile range (IQR) | 934.19 |
Descriptive statistics
| Standard deviation | 3059.03798 |
|---|---|
| Coefficient of variation (CV) | 1.794033209 |
| Kurtosis | 12.66161364 |
| Mean | 1705.117812 |
| Median Absolute Deviation (MAD) | 335.18 |
| Skewness | 3.442331539 |
| Sum | 2684322639 |
| Variance | 9357713.364 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 225.51 | 2557 | 0.2% |
| 378 | 2219 | 0.1% |
| 226.32 | 2188 | 0.1% |
| 224 | 1515 | 0.1% |
| 370 | 1459 | 0.1% |
| 260 | 1378 | 0.1% |
| 210 | 1095 | 0.1% |
| 189 | 1051 | 0.1% |
| 216 | 1031 | 0.1% |
| 150 | 965 | 0.1% |
| Other values (270973) | 1558816 |
| Value | Count | Frequency (%) |
| 0.06 | 1 | |
| 109.87 | 1 | |
| 109.94 | 1 | |
| 110.13 | 1 | |
| 110.2 | 1 | |
| 110.66 | 1 | |
| 110.82 | 1 | |
| 111.12 | 1 | |
| 111.37 | 1 | |
| 111.46 | 1 |
| Value | Count | Frequency (%) |
| 19891.99 | 4 | |
| 19891 | 2 | |
| 19890.99 | 1 | < 0.1% |
| 19890.68 | 1 | < 0.1% |
| 19890.5 | 4 | |
| 19890 | 4 | |
| 19889 | 2 | |
| 19888.88 | 1 | < 0.1% |
| 19888.1 | 1 | < 0.1% |
| 19888 | 1 | < 0.1% |
| Distinct | 253286 |
|---|---|
| Distinct (%) | 16.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1706.024854 |
| Minimum | 0.06 |
|---|---|
| Maximum | 19891.99 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 12.0 MiB |
Quantile statistics
| Minimum | 0.06 |
|---|---|
| 5-th percentile | 229.6 |
| Q1 | 290.41 |
| median | 590.21 |
| Q3 | 1224.81 |
| 95-th percentile | 7380.64 |
| Maximum | 19891.99 |
| Range | 19891.93 |
| Interquartile range (IQR) | 934.4 |
Descriptive statistics
| Standard deviation | 3061.434202 |
|---|---|
| Coefficient of variation (CV) | 1.79448394 |
| Kurtosis | 12.66613727 |
| Mean | 1706.024854 |
| Median Absolute Deviation (MAD) | 335.26 |
| Skewness | 3.443151746 |
| Sum | 2685750572 |
| Variance | 9372379.373 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 225.51 | 2550 | 0.2% |
| 378 | 2243 | 0.1% |
| 226.32 | 2180 | 0.1% |
| 224 | 1674 | 0.1% |
| 370 | 1513 | 0.1% |
| 260 | 1441 | 0.1% |
| 210 | 1190 | 0.1% |
| 189 | 1051 | 0.1% |
| 216 | 1026 | 0.1% |
| 150 | 965 | 0.1% |
| Other values (253276) | 1558441 |
| Value | Count | Frequency (%) |
| 0.06 | 1 | |
| 109.94 | 1 | |
| 111.89 | 1 | |
| 112.53 | 1 | |
| 115.07 | 1 | |
| 116.66 | 1 | |
| 117.31 | 1 | |
| 117.66 | 1 | |
| 117.99 | 1 | |
| 118.52 | 1 |
| Value | Count | Frequency (%) |
| 19891.99 | 5 | |
| 19891 | 3 | |
| 19890.68 | 1 | < 0.1% |
| 19890.5 | 4 | |
| 19890 | 4 | |
| 19889 | 2 | < 0.1% |
| 19888.88 | 1 | < 0.1% |
| 19888.1 | 1 | < 0.1% |
| 19888 | 2 | < 0.1% |
| 19886.12 | 1 | < 0.1% |
| Distinct | 269480 |
|---|---|
| Distinct (%) | 17.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1704.113168 |
| Minimum | 0.06 |
|---|---|
| Maximum | 19891.98 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 12.0 MiB |
Quantile statistics
| Minimum | 0.06 |
|---|---|
| 5-th percentile | 229.48 |
| Q1 | 290.18 |
| median | 589.98 |
| Q3 | 1224.09 |
| 95-th percentile | 7375.8685 |
| Maximum | 19891.98 |
| Range | 19891.92 |
| Interquartile range (IQR) | 933.91 |
Descriptive statistics
| Standard deviation | 3056.504679 |
|---|---|
| Coefficient of variation (CV) | 1.793604285 |
| Kurtosis | 12.65734256 |
| Mean | 1704.113168 |
| Median Absolute Deviation (MAD) | 335.17 |
| Skewness | 3.441538355 |
| Sum | 2682741053 |
| Variance | 9342220.853 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 225.51 | 2552 | 0.2% |
| 378 | 2232 | 0.1% |
| 226.32 | 2182 | 0.1% |
| 224 | 1544 | 0.1% |
| 370 | 1473 | 0.1% |
| 260 | 1384 | 0.1% |
| 210 | 1098 | 0.1% |
| 189 | 1051 | 0.1% |
| 216 | 1041 | 0.1% |
| 150 | 965 | 0.1% |
| Other values (269470) | 1558752 |
| Value | Count | Frequency (%) |
| 0.06 | 1 | |
| 109.87 | 1 | |
| 109.94 | 1 | |
| 110 | 1 | |
| 110.13 | 1 | |
| 110.2 | 1 | |
| 110.48 | 1 | |
| 110.5 | 1 | |
| 110.66 | 1 | |
| 110.67 | 1 |
| Value | Count | Frequency (%) |
| 19891.98 | 3 | |
| 19890.99 | 3 | |
| 19890.68 | 1 | < 0.1% |
| 19890.49 | 4 | |
| 19889.99 | 4 | |
| 19888.99 | 2 | |
| 19888.88 | 1 | < 0.1% |
| 19888.1 | 1 | < 0.1% |
| 19887.99 | 1 | < 0.1% |
| 19885.11 | 1 | < 0.1% |
| Distinct | 271090 |
|---|---|
| Distinct (%) | 17.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1705.12341 |
| Minimum | 0.06 |
|---|---|
| Maximum | 19891.99 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 12.0 MiB |
Quantile statistics
| Minimum | 0.06 |
|---|---|
| 5-th percentile | 229.54 |
| Q1 | 290.3 |
| median | 590.02 |
| Q3 | 1224.49 |
| 95-th percentile | 7379.93 |
| Maximum | 19891.99 |
| Range | 19891.93 |
| Interquartile range (IQR) | 934.19 |
Descriptive statistics
| Standard deviation | 3059.105067 |
|---|---|
| Coefficient of variation (CV) | 1.794066663 |
| Kurtosis | 12.66189191 |
| Mean | 1705.12341 |
| Median Absolute Deviation (MAD) | 335.15 |
| Skewness | 3.442370554 |
| Sum | 2684331452 |
| Variance | 9358123.813 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 225.51 | 2532 | 0.2% |
| 378 | 2230 | 0.1% |
| 226.32 | 2180 | 0.1% |
| 224 | 1682 | 0.1% |
| 370 | 1496 | 0.1% |
| 260 | 1415 | 0.1% |
| 210 | 1191 | 0.1% |
| 189 | 1051 | 0.1% |
| 216 | 1036 | 0.1% |
| 150 | 965 | 0.1% |
| Other values (271080) | 1558496 |
| Value | Count | Frequency (%) |
| 0.06 | 1 | |
| 109.94 | 1 | |
| 110 | 1 | |
| 110.48 | 1 | |
| 110.5 | 1 | |
| 110.67 | 1 | |
| 111.31 | 1 | |
| 111.47 | 1 | |
| 111.57 | 1 | |
| 111.89 | 1 |
| Value | Count | Frequency (%) |
| 19891.99 | 1 | < 0.1% |
| 19891.98 | 3 | |
| 19891 | 1 | < 0.1% |
| 19890.99 | 2 | |
| 19890.68 | 1 | < 0.1% |
| 19890.5 | 4 | |
| 19890 | 4 | |
| 19889 | 1 | < 0.1% |
| 19888.99 | 1 | < 0.1% |
| 19888.88 | 1 | < 0.1% |
| Distinct | 969740 |
|---|---|
| Distinct (%) | 61.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7.073412489 |
| Minimum | 1 × 10-8 |
|---|---|
| Maximum | 1563.267113 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 12.0 MiB |
Quantile statistics
| Minimum | 1 × 10-8 |
|---|---|
| 5-th percentile | 0.0574 |
| Q1 | 0.6915 |
| median | 2.3815 |
| Q3 | 7.032457005 |
| 95-th percentile | 27.55502222 |
| Maximum | 1563.267113 |
| Range | 1563.267113 |
| Interquartile range (IQR) | 6.340957005 |
Descriptive statistics
| Standard deviation | 16.98568743 |
|---|---|
| Coefficient of variation (CV) | 2.401342698 |
| Kurtosis | 346.3551446 |
| Mean | 7.073412489 |
| Median Absolute Deviation (MAD) | 2.081810245 |
| Skewness | 12.65412279 |
| Sum | 11135489.37 |
| Variance | 288.5135775 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0.01 | 16534 | 1.1% |
| 0.02 | 3812 | 0.2% |
| 1 | 3374 | 0.2% |
| 0.03 | 2532 | 0.2% |
| 0.1 | 2300 | 0.1% |
| 10 | 2166 | 0.1% |
| 0.05 | 1809 | 0.1% |
| 0.04 | 1447 | 0.1% |
| 0.06 | 1415 | 0.1% |
| 0.2 | 1387 | 0.1% |
| Other values (969730) | 1537498 |
| Value | Count | Frequency (%) |
| 1 × 10-8 | 27 | |
| 2 × 10-8 | 3 | < 0.1% |
| 4 × 10-8 | 1 | < 0.1% |
| 8 × 10-8 | 5 | < 0.1% |
| 9 × 10-8 | 1 | < 0.1% |
| 1 × 10-7 | 1 | < 0.1% |
| 1.1 × 10-7 | 1 | < 0.1% |
| 1.2 × 10-7 | 1 | < 0.1% |
| 1.4 × 10-7 | 1 | < 0.1% |
| 1.6 × 10-7 | 4 | < 0.1% |
| Value | Count | Frequency (%) |
| 1563.267113 | 1 | |
| 1156.319405 | 1 | |
| 1086.12987 | 1 | |
| 1068.447205 | 1 | |
| 1041.413142 | 1 | |
| 931.7841041 | 1 | |
| 906.04586 | 1 | |
| 899.8325768 | 1 | |
| 845.311 | 1 | |
| 790.6707322 | 1 |
Volume_(Currency)
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONSKEWED| Distinct | 1464368 |
|---|---|
| Distinct (%) | 93.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 22679.27881 |
| Minimum | 2.6417 × 10-6 |
|---|---|
| Maximum | 19970764.73 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 12.0 MiB |
Quantile statistics
| Minimum | 2.6417 × 10-6 |
|---|---|
| 5-th percentile | 22.35765 |
| Q1 | 316.236084 |
| median | 1398.623695 |
| Q3 | 7601.787068 |
| 95-th percentile | 89921.89698 |
| Maximum | 19970764.73 |
| Range | 19970764.73 |
| Interquartile range (IQR) | 7285.550984 |
Descriptive statistics
| Standard deviation | 122515.6166 |
|---|---|
| Coefficient of variation (CV) | 5.402094907 |
| Kurtosis | 1258.573772 |
| Mean | 22679.27881 |
| Median Absolute Deviation (MAD) | 1317.047393 |
| Skewness | 23.51722215 |
| Sum | 3.570339897 × 1010 |
| Variance | 1.50100763 × 1010 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 2.2551 | 2304 | 0.1% |
| 2.2632 | 2143 | 0.1% |
| 9.8255498 | 1359 | 0.1% |
| 0.896 | 1327 | 0.1% |
| 260 | 1224 | 0.1% |
| 1.89 | 1051 | 0.1% |
| 2.16 | 1024 | 0.1% |
| 3.93 | 958 | 0.1% |
| 4.83 | 902 | 0.1% |
| 3.78 | 746 | < 0.1% |
| Other values (1464358) | 1561236 |
| Value | Count | Frequency (%) |
| 2.6417 × 10-6 | 12 | |
| 2.676 × 10-6 | 14 | |
| 4.2555 × 10-6 | 1 | < 0.1% |
| 4.578 × 10-6 | 1 | < 0.1% |
| 4.8242 × 10-6 | 1 | < 0.1% |
| 4.9064 × 10-6 | 1 | < 0.1% |
| 9.602 × 10-6 | 1 | < 0.1% |
| 2.3896 × 10-5 | 1 | < 0.1% |
| 2.934 × 10-5 | 1 | < 0.1% |
| 3.53768 × 10-5 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 19970764.73 | 1 | |
| 11983480.18 | 1 | |
| 11250613.26 | 1 | |
| 9910357.061 | 1 | |
| 9615532.16 | 1 | |
| 9480710.865 | 1 | |
| 8777059.067 | 1 | |
| 8732203.93 | 1 | |
| 8529501.608 | 1 | |
| 8437343.644 | 1 |
Weighted_Price
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATION| Distinct | 1265901 |
|---|---|
| Distinct (%) | 80.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1705.069222 |
| Minimum | 0.06 |
|---|---|
| Maximum | 19891.98753 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 12.0 MiB |
Quantile statistics
| Minimum | 0.06 |
|---|---|
| 5-th percentile | 229.5457334 |
| Q1 | 290.303099 |
| median | 590.0206858 |
| Q3 | 1224.452841 |
| 95-th percentile | 7379.050129 |
| Maximum | 19891.98753 |
| Range | 19891.92753 |
| Interquartile range (IQR) | 934.149742 |
Descriptive statistics
| Standard deviation | 3058.975514 |
|---|---|
| Coefficient of variation (CV) | 1.794047699 |
| Kurtosis | 12.66178369 |
| Mean | 1705.069222 |
| Median Absolute Deviation (MAD) | 335.1498518 |
| Skewness | 3.442355797 |
| Sum | 2684246144 |
| Variance | 9357331.193 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 225.51 | 2451 | 0.2% |
| 378 | 2163 | 0.1% |
| 226.32 | 2151 | 0.1% |
| 370 | 1419 | 0.1% |
| 224 | 1399 | 0.1% |
| 260 | 1309 | 0.1% |
| 210 | 1095 | 0.1% |
| 189 | 1051 | 0.1% |
| 216 | 1024 | 0.1% |
| 150 | 965 | 0.1% |
| Other values (1265891) | 1559247 |
| Value | Count | Frequency (%) |
| 0.06 | 1 | |
| 109.94 | 1 | |
| 111.365 | 1 | |
| 111.89 | 1 | |
| 114.72 | 1 | |
| 115.045 | 1 | |
| 115.07 | 1 | |
| 116.565 | 1 | |
| 116.66 | 1 | |
| 116.72 | 1 |
| Value | Count | Frequency (%) |
| 19891.98753 | 1 | |
| 19891.98471 | 1 | |
| 19891.98329 | 1 | |
| 19891.27289 | 1 | |
| 19890.99936 | 1 | |
| 19890.99823 | 1 | |
| 19890.88615 | 1 | |
| 19890.52346 | 1 | |
| 19890.49973 | 1 | |
| 19890.49881 | 1 |
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here. A simple visualization of nullity by column.
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
First rows
| Timestamp | Open | High | Low | Close | Volume_(BTC) | Volume_(Currency) | Weighted_Price | |
|---|---|---|---|---|---|---|---|---|
| 0 | 1417411980 | 300.0 | 300.0 | 300.0 | 300.0 | 0.01 | 3.0 | 300.0 |
| 1 | 1417412040 | 300.0 | 300.0 | 300.0 | 300.0 | 0.01 | 3.0 | 300.0 |
| 2 | 1417412100 | 300.0 | 300.0 | 300.0 | 300.0 | 0.01 | 3.0 | 300.0 |
| 3 | 1417412160 | 300.0 | 300.0 | 300.0 | 300.0 | 0.01 | 3.0 | 300.0 |
| 4 | 1417412220 | 300.0 | 300.0 | 300.0 | 300.0 | 0.01 | 3.0 | 300.0 |
| 5 | 1417412280 | 300.0 | 300.0 | 300.0 | 300.0 | 0.01 | 3.0 | 300.0 |
| 6 | 1417412340 | 300.0 | 300.0 | 300.0 | 300.0 | 0.01 | 3.0 | 300.0 |
| 7 | 1417412400 | 300.0 | 300.0 | 300.0 | 300.0 | 0.01 | 3.0 | 300.0 |
| 8 | 1417412460 | 300.0 | 300.0 | 300.0 | 300.0 | 0.01 | 3.0 | 300.0 |
| 9 | 1417412520 | 300.0 | 300.0 | 300.0 | 300.0 | 0.01 | 3.0 | 300.0 |
Last rows
| Timestamp | Open | High | Low | Close | Volume_(BTC) | Volume_(Currency) | Weighted_Price | |
|---|---|---|---|---|---|---|---|---|
| 1574264 | 1515369060 | 16221.01 | 16221.01 | 16200.69 | 16221.00 | 2.366366 | 38368.605933 | 16214.144086 |
| 1574265 | 1515369120 | 16221.01 | 16221.01 | 16172.21 | 16174.22 | 19.626989 | 317758.570910 | 16189.878976 |
| 1574266 | 1515369180 | 16174.22 | 16174.22 | 16174.21 | 16174.21 | 7.481674 | 121010.227010 | 16174.217319 |
| 1574267 | 1515369240 | 16174.22 | 16174.22 | 16174.21 | 16174.22 | 7.421392 | 120035.176120 | 16174.213396 |
| 1574268 | 1515369300 | 16174.21 | 16174.22 | 16174.21 | 16174.21 | 3.030103 | 49009.542468 | 16174.218650 |
| 1574269 | 1515369360 | 16174.21 | 16174.23 | 16174.21 | 16174.23 | 7.594119 | 122828.956770 | 16174.221301 |
| 1574270 | 1515369420 | 16174.23 | 16174.23 | 16174.21 | 16174.22 | 11.902468 | 192513.150940 | 16174.221081 |
| 1574271 | 1515369480 | 16174.22 | 16174.22 | 16174.21 | 16174.21 | 3.860840 | 62446.073684 | 16174.218136 |
| 1574272 | 1515369540 | 16174.22 | 16174.22 | 16174.21 | 16174.22 | 1.179093 | 19070.914509 | 16174.219514 |
| 1574273 | 1515369600 | 16174.22 | 16174.23 | 16174.22 | 16174.22 | 5.401224 | 87360.593222 | 16174.220219 |